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P. gingivalis sequences 
FIELD OF THE INVENTION 

5 The present invention relates to P, gingivalis nucleotide sequences, P. 

gingivalis polypeptides and probes for detection of P. gingivalis, 

BACKGROUND OF THE INVENTION 

10 Periodontal diseases are bacterial-associated inflammatory diseases of 

the supporting tissues of the teeth and range from the relatively mild form of 
gingivitis, the non-specific, reversible inflammation of gingival tissue to the 
more aggressive forms of periodontitis which are characterised by the 
destruction of the tooth's supporting structures. Periodontitis is associated 

15 with a subgingival infection of a consortium of specific Gram-negative 
bacteria that leads to the destruction of the periodontium and is a major 
public health problem. One bacterium that has attracted considerable 
interest is P. gingivalis as the recovery of this microorganism from adult 
periodontitis lesions can be up to 50% of the subgingival anaerobically 

20 cultivable flora, whereas P. gingivalis is rarely recovered, and then in low 
numbers, from healthy sites. A proportional increase in the level of 
P. gingivalis in subgingival plaque has been associated with an increased 
severity of periodontitis and eradication of the microorganism from the 
cultivable subgingival microbial population is accompanied by resolution of 

25 the disease. The progression of periodontitis lesions in non-human primates 
has been demonstrated with the subgingival implantation of P. gingivalis. 
These findings in both animals and humans suggest a major role for P, 
gingivalis in the development of adult periodontitis. 

P. gingivalis is a black-pigmented, anaerobic, asaccharolytic, 

30 proteolytic Gram-negative rod that obtains energy from the metabolism of 
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specific amino acids. The microorganism has an absolute growth 
requirement for iron, preferentially in the form of haeme or its Fe(III) 
oxidation product haemin and when grown under conditions of excess 
haemin is highly virulent in experimental animals. A number of virulence 
5 factors have been implicated in the pathogenicity of R gingivalis including 
the capsule, adhesins, cytotoxins and extracellular hydrolytic enzymes. 

In order to develop an efficacious and safe vaccine to prevent 
P. gingivalis colonisation it is necessary to identify and produce antigens that 
are involved in virulence that have utility as inununogens to generate 

10 neutralising antibodies. Whilst it is possible to attempt to isolate antigens 
directly from cultures of P. gingivalis this is often difficult. For example as 
mentioned above, P. gingivalis is a strict anaerobe and can be difficult to 
isolate and grow. It is also known that, for a number of organisms, when 
cultured in vitro that many virulence genes are down regulated and the 

15 encoded proteins are no longer expressed. If conventional chemistry 

techniques were applied to purify vaccine candidates potentially important 
(protective) molecules may not be identified. With DNA sequencing, as the 
gene is present (but not transcribed) even when the organism is grown in 
vitro it can be identified, cloned and produced as a recombinant DNA 

20 protein. Similarly, a protective antigen or therapeutic target may be 

transiently expressed by the organism in vitro or produced in low levels 
making the identification of these molecules extremely difficult by 
conventional methods. 

With serological identification of therapeutic targets one is limited to 

25 those responses which are detectable using standard methods such as 
Western Blotting or EUSA. The hmitation here is the both the level of 
response that is generated by the animal or human and determining whether 
this response is protective, damaging or irrelevant No such limitation is 
present with a sequencing approach to the identification of potential 

30 therapeutic or prophylactic targets. 
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It is also well known that P. gingivalis produces a range of broadly 
active proteases (University of Melbourne International Patent Application 
No PCT /AU 96/00673, US Patent Nos 5,475,097 and 5,523,390), which make 
the identification of intact proteins difficult because of their degradation by 
5 these proteases. 

SUMMARY OF THE INVENTION 

The present inventors have attempted to isolate P. gingivalis 

10 nucleotide sequences which can be used for recombinant production of 
P, gingivalis polypeptides and to develop nucleotide probes specific for 
P. gingivalis. The DNA sequences listed below have been selected from a 
large number of P. gingivalis sequences according to their indicative 
potential as vaccine candidates. This intuitive step involved comparison of 

15 the deduced protein sequence from the P. gingivalis DNA sequences to the 
known protein sequence databases. Some of the characteristics used to 
select useful vaccine candidates include; the expected cellular location, such 
as outer membrane proteins or secreted proteins, particular functional 
activities of similar proteins such as those with an enzymatic or proteolytic 

20 activity, proteins involved in essential metabolic pathways that when 

inactivated or blocked may be deleterious or lethal to the organism, proteins 
that might be expected to play a role in the pathogenesis of the organism 
eg. red cell lysis, cell agglutination or cell receptors and proteins which are 
paralogues to proteins with proven vaccine efficacy. DNA sequences that 

25 were considered to be poor vaccine candidates and not selected include 
those that code for proteins involved in replication, non-essential proteins 
involved in cellular processes and those proteins present at sites that would 
be unlikely to be affected by immune mediators such as those found in the 
bacterial cytoplasm or inner membranes. 



In a first aspect the present invention consists in an isolated 
P. gingivalis nucleotide sequence, the nucleotide sequence consisting of or 
including a sequence selected from the group consisting of SEQ ID NO: 1, 
SEQ ID NO: 2, SEQ ID NO: 3, SEQ ID NO: 4, SEQ ID NO: 5, SEQ ID NO: 6, 
fragments thereof and sequences complementary thereto. 

In a second aspect the present invention consists in an isolated 
P. gingivalis polypeptide, the polypeptide being at least partially encoded by 
a nucleotide consisting of or including a sequence selected from the group 
consisting of SEQ ID NO: 1, SEQ ID NO: 2, SEQ ID NO: 3, SEQ ID NO: 4, 
SEQ ID NO: 5, SEQ ID NO: 6, fragments thereof and sequences 
complementary thereto. 

In a third aspect the present invention consists in an isolated 
P, gingivalis polypeptide consisting of or including an amino acid sequence 
selected from the group consisting of SEQ ID NO: 7, SEQ ID NO: 8, SEQ ID 
NO: 9, SEQ ID NO: 10, SEQ ID NO: 11, SEQ ID NO: 12, SEQ ID NO: 13, SEQ 
ID NO: 14, SEQ ID NO: 15, SEQ ID NO: 16, SEQ ID NO: 17, SEQ ID NO: 18 
or fragments thereof. 

In a fourth aspect the present invention consists in an isolated 
polynucleotide, the polynucleotide encoding a polypeptide consisting of or 
including an amino acid sequence selected from the group consisting of SEQ 
ID NO: 7, SEQ ID NO: 8, SEQ ID NO: 9, SEQ ID NO: 10, SEQ ID NO: 11, SEQ 
ID NO: 12, SEQ ID NO: 13, SEQ ID NO: 14, SEQ ID NO: 15, SEQ ID NO: 16, 
SEQ ID NO: 17, SEQ ID NO: 18 or fragments thereof. 

In a fourth aspect the present invention consists in a nucleotide probe 
specific for P. gingivalis, the probe including a detectable label and a 
nucleotide sequence of at least 15 nucleotides, the nucleotide sequence 
being derived from a sequence selected from the group consisting SEQ ID 
NO: 1, SEQ ID NO: 2, SEQ ID NO: 3, SEQ ID NO: 4, SEQ ID NO: 5, SEQ ID 
NO: 6 and sequences complementary thereto. 
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In a fifth aspect the present invention consists in a composition for 
use in raising an immune response in an animal directed against P. 
Gingivolis the composition including an acceptable carrier and/or adjuvant 
and at least one polypeptide having a sequence selected from the group 
5 consisting of SEQ ID NO: 7, SEQ ID NO: 8, SEQ ID NO: 9, SEQ ID NO: 10, 
SEQ ID NO: 11, SEQ ID NO: 12, SEQ ID NO: 13, SEQ ID NO: 14, SEQ ID NO: 
15, SEQ ID NO: 16, SEQ ID NO: 17, SEQ ID NO: 18 or fragments thereof. 

As will be understood by those skilled in the art the nucleotides of the 
present invention may be useful in DNA vaccination to reduce the incidence 
10 and/or severity of P. gingivolis infection. 

Accordingly in a sixth aspect the present invention consists in a 
composition for use in inducing an immune response, the composition 
including at least one DNA molecule, the at least one DNA molecule having 
or including a sequence selected from the group consisting of SEQ ID NO: 1, 
15 SEQ ID NO: 2, SEQ ID NO: 3, SEQ ID NO: 4, SEQ ID NO: 5, SEQ ID NO: 6, 
fragments thereof and sequences complementary thereto. 

Further information regarding DNA vaccination may be found in 
Donnelly et al. Journal of Immunological Methods 176(1994) 145-152, the 
disclosure of which is incorporated herein by reference. 
20 Throughout this specification, unless the context requires otherwise, 

the word "comprise", or variations such as "comprises" or "comprising", will 
be understood to imply the inclusion of a stated element or integer or group 
of elements or integers but not the exclusion of any other element or integer 
or group of elements or integers. 

25 

DETAILED DESCRIPTION 

Preparation of the P. gingivolis library for sequencing. 
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To determine the DNA sequence of P. gingivalis genomic DNA was isolated 
from P. gingivalis strain W50 (ATCC 53978) essentially by the method 
described by Mamur J. (1961)(1). Cloning of DNA fragments was performed 
essentially as described by Fleischmann et al,, (1995)(2). Briefly, purified 
5 genomic DNA from R gingivalis was nebulized to fragment the DNA and 
was treated with Bal31 nuclease to create blunt ends then run twice on 
preparative 1% agarose gels. DNA fragments of 1.6-2.0 kb were excised from 
the gel and the DNA recovered. This DNA was then ligated to the vector 
pUCl8 [Smal digested and dephosphorylated; Pharmacia) and 

10 electrophoresed on a 1% agarose preparative gel. The fragment comprising 
linear vector plus one insert was excised, purified and this process repeated 
to reduce any vector without insert contamination. The recovered vector 
plus insert DNA was blunt-ended with T4 DNA polymerase, then a final 
ligation to produce circular DNA was performed. Aliquots of Epicurian Coli 

15 Electroporation-Competent Cells (Stratagene) were transformed with the 
library DNA and plated out on SOB agar antibiotic diffusion plates 
containing X-gal and incubated at SZ^'C overnight. Colonies with inserts 
appeared white and those without inserts (vector alone) appeared blue. 
Plates were stored at 4°C until the white clones were picked and expanded 

20 for the extraction of plasmid DNA for sequencing. 

DNA sequencing 

Plasmid DNA was prepared by picking bacterial colonies into 1.5 ml of 
LB, TB or SOB broth supplemented with 50-lOOug/ml Ampicillin in 96 deep 
25 well plates. Plasmid DNA was isolated using the QIAprep Spin or QIAprep 
96 Turbo miniprep kits (QIAGEN GmbH, Germany). DNA was eluted into a 
96 well gridded array and stored at -20C. 

Sequencing reactions were performed using ABI PRISM Dye 
Terminator and ABI PRISM BIGDye Terminator Cycle Sequencing Ready 
30 Reaction kits with AmpliTaq DNA polymerase FS (PE Applied Biosystems, 
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Foster City, CA) using the Ml 3 Universal forward and reverse sequencing 
primers. Sequence reactions were conducted on either a Perkin-Elmer 
Gene Amp 9700 (PE Applied Biosys terns) or Hybaid PGR Express (Hybaid, 
UK) thermal cyclers. Sequencing reactions were analysed on ABI PRISM 
5 377 DNA sequencers (PE Applied Biosystems). 

The sequences obtained are set out below. 

SeqID#l-9 represents the DNA sequence that encodes proteins in 
SeqID#10-27. Proteins in SeqID#10-18 are the entire open reading frame 
from DNA SeqID#l-9. Proteins in SeqID#19-27 are the proteins encoded by 

10 DNA SeqID#l-9 from their putative intitiation codon. The initiation codon 
was calculated from sequence homology alignment using FastX or by the 
ORF prediction program GeneMark. 

As will be understood by those skilled in the art open reading frames 
(ORFs) may be readily identified. ORFs may be determined using two 

15 methods, for example, alignments from FastX search results may be used to 
define the start and end positions of coding regions if sufficient protein 
homology is present. Alternatively, protein coding regions may be identified 
using the ORF identification progam GeneMark (3) using a training matrix 
based on published P.gingivalis sequences. This matrix may be further 

20 refined by adding ORFs identified from the results of homology searching 

and ORFs identified by GeneMark. The program PSORT (4) may be used for 
the detection of signal sequences and the prediction of protein cell 
localisation. A UNDC version of TopPred (5) may be used to identify 
potential membrane spanning domains. 

25 

DNA sequence analysis 

Raw trace data files from the ABI 377 sequencer were manually 
trimmed using Staden Pregap(Laboratory of Molecular Biology, Medical 
Research Council, UK) running on a Sun Microsystem computer. Trimmed 
30 files were assembled into contigs using Staden Gap v4.1 and exported as a 
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consensus file in FastA format. This consensus was converted into GCG 
format files and analysed for homology using the FASTX algorithm on a non- 
redundant protein database compiled by ANGIS (Australian Genomic 
Information Service, University of Sydney). Individual FASTX search 
5 results were examined for significant homology by statistical probability and 
amino acid alignments. 

The results are set out in Table 1. 
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(2) INFORMATION FOR SEQ ID NO:l 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 960 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 



(ix) FEATURE: 

(A) NAME/KEY: miscf eature 

(B) LOCATION 1...960 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l 



60 



ccgcaatcct ctcctgatcg aagaagcttc caaaacgtca tgaataaata tcattctcaa 
agcgttttag aggtcggcaa aattgggatt gtgattatct ttgcgcccat agtacggaat 120 
gtacatcaac aacccccttt tttaagccat aaatcaatta tgcgtattgt cagtaatttt 180 
ttgttcgtct ctttttcggt tttgcttttt gcatcatgcc gttcccagcg agaaaaggtc 2 40 

gtttacctgc aagatatcca aacttttaat cgggagatta tcgctaaacc atatgacgta 
aaaattgaga aggacgatgt gctgaacatc cttgtcagca gtagagaccc ggagctttca 
acgccctaca accaagtgtt gaccactcgt gcactggccc gcaacggcta tggaacgaac 
tcgaacgaag gcttcctggt cgattcgaaa gggtacatca attatcctat tttaggccag 
atctatgtag agggccttac tcgtaccgaa ctggagaagg agatacagaa gaggattatt 
tccagtggat ttatcaagga tcctacggta acggtgcagc ttcaaaattt caaggtgtcg 
gttttgggag aggtgaatca tccgggttcg atgtcggtaa aaggagagcg aataactctt 
ttggaagcga tcggaatggc cggagacctg acaatctatg gtcgccgcga tcgggttttt 
gtgattagag aaaccgatgg gcatcgcgag gttttccaga cggatctcag aaaggccgac 
ttgctcgcaa gccccgtgta ctatctgcat cagaacgacg tcatctatgt ggagccgaac 
gacaagaaaa cacagatgag cgagatcaac cagaataata acgtaaacgt atggctgagt 
gttacctcca ctttggtatc catttccacg ctgacgatta cgataataga taagaccaaa 
960 



300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 



(2) INFORMATION FOR SEQ ID NO: 2 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1338 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : double 
{ D ) TOPOLOGY : circular 



(ii) MOLECULE TYPE: DNA (genomic) 



(iii) HYPOTHETICAL: NO 



(iv) ANTI-SENSE: NO 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 

(ix) FEATURE: 

(A) NAME/KEY: misc feature 
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(B) LOCATION 1...1338 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 



aacccaaaaa aagcccttgc acttaatctg aaaggaaaac cgctgcccga gatgctggcc 
qaaccggccc aaagtcctac ttacgcggtc gtgcccgacg attttgaagg tgttatcccc 
aaggtgacgg ctcgtccggg ggataaggtg cgtgccggct cagcactgat gcaccacaag 
qcatatccgg agatgaagtt tacaagtccg gttagcggcg aagtgatcgc ggtgaatcgc 
ggtgccaagc gcaaggtgtt gagcatcgag gtgaaaccgg acggactgaa cgaatacgag 
tcattccctg tcggggatcc gtctgccctc tctgccgaac agatcaagga gcttttactg 
tcgagcggta tgtggggttt tattaagcaa cgtccttacg acatagtggc tacaccggat 
atagctccac gcgacattta tattactgcc aactttactg caccattggc tccggacttc 
gatttcatcg ttcgaggaga agaacgcgcc ctgcagactg ccatcgatgc cttggccaaa 
ctcacgacag gaaaggtgta tgtgggcctg aagccgggtt catctctggg cttgcacaat 
gcagaaatcg tagaagtaca cggacctcat ccggcaggta acgtgggcgt gctgatcaat 
catacgaagc caatcaatcg gggcgaaacg gtgtggacgc tcaaggctac cgacctgatc 
gtgatcggac gtttcctgct tacgggcaaa gccgatttta ccagaatgat tgccatgacc 
qgctcagacg ctgcagctca cggatacgtc cgtattatgc cgggttgcaa tgtctttgct 
tccttccccg gccgactgac aataaaggaa tctcacgagc gtgtgatcga tggcaatgtg 
ctgaccggta agaagctctg cgagaaggag cctttcctgt cagcccggtg tgaccagatc 
acggtgatcc ccgaaggcga cgatgtggac gaactcttcg ggtgggctgc accccgtctc 
gatcagtaca gcatgagcag agcttatttc tcttggttgc aggggaaaaa caaagagtac 
gtactcgatg cccggatcaa gggtggcgaa cgtgctatga tcatgagcaa cgagtatgac 
cgcgttttcc cgatggacat ctatccggag tatttgctca aggctattat agcattcgac 
atcgacaaga tggaggactt aggcatatat gaagtggctc cggaggactt tgccacttgc 
gaatttgtgg atacatccaa gatcgagctg cagcgtatcg ttcgcgaggg cttggatatg 
ctctataagg aaatgaat 

(2) INFORMATION FOR SEQ ID NO : 3 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 987 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 

(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION 1. . .987 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 

aaccttagga cacagccttt ctttttggta gatttgcaat ctatgatcag aacgatactt 
tcacgatatg tatcctcgaa cttttggagt cggggagcta cctttttttt cacgattttc 
ccggccttca tcctcgccgc tactgctttg ccggcttgtg gagggggtac tgcttcaggc 
tccgatcgta cgctggctgt gaccatcgag ccacagaaat acttcatcga gtccattgcg 
gataagtcgg tgcaggtggt ggcattggta ccggccggca gcaatccgga ggaatacgac 
ccttcgccta ccgtgatgaa gcgtttgtcc gaagcagatg cctacttcta tataggagga 
ctggggttcg agcaaagaaa tctcgctgcc attcgggaca ataaccctaa gctccctctt 
ttcgaaatgg gcaaagcctt ggcggatgcc ggaagtgcag atctccacgg ctcctgcaca 
gatcattctc atacagacct gcatgcccat gatccgcact attggagcag tgtggtaggg 
gcaaaggcac tcagtcgtgc tgcatacgac gcgcttgtgg agctttatcc gaacgagaaa 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1338 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 




gacaaatggg 
gtcgatacca 
ctcagctttt 
gagcctacgg 
atcgtattta 
ggtgctcgtc 
attgctcgcg 



acaaagggca 
tgtttgccaa 
tcgcccaaga 
ctgcccacct 
tccaacccga 
cggtaaggat 
ctttggctca 



cgaccgtctc 
tggcaaagca 
gttcggcctg 
tcgtcgtgtg 
atttgaaacg 
caatcctctg 
tgaacgg 
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aacggacgta 
gacaaagcct 
cggcagatcg 
atcgatcagg 
cgtcaggcgg 
cgcagctcgt 



tcgacagcgt 
tcgtcatata 
tcatagagga 
cacgtgccga 
aggacatcgc 
gggaggagga 



gaagagactc 
tcacccatcg 
agatgggaaa 
tggtgtcaga 
acgcgagatc 
aattttacat 



660 
720 
780 
840 
900 
960 
987 



(2) INFORMATION FOR SEQ ID NO : 4 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1095 base pairs 

(B) TYPE: nucleic acid 

{ C ) STRANDEDNESS : double 
( D ) TOPOLOGY : circular 

(ii) MOLECULE TYPE: DNA (genomic) 



(iii) HYPOTHETICAL: NO 



(iv) ANTI-SENSE: NO 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingival! s 

(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION 1...1095 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 



60 
120 
180 
240 
300 



tgctgcgcaa ggcttcggga cacctctccc cggaagaagt cgtttcggaa tgataccgtt 
tttccttatt tcgttattca tctgatcaaa catattatca ttatgacgga caacaaacaa 
cgtaatatcg tattcccggc gtttctcctc ttgctgggag tcatcgcagt ggtgacgatc 
gttggttttt tcatgctcag accggccgag gagattatcc aaggacagat agaagtgacc 
gaataccgag tgtccagcaa agtgcccggg cgcatcaagg aacttagggt atccgaggga 
cagcaggtgc aggccggcga taccctcgct gtcatcgaag cccccgacgt agcggctaag 360 
atggagcagg caaaggctgc cgaagcagct gcacaggctc agaacgccaa ggctctcaaa 420 
ggagcacgca gcgaacagat acaggcagcc tatgagatgt ggcagaaagc tcaggccggc 480 
gtagccatag cgaccaagac acaccagcgc gtgcagaacc tctatgacca gggagtggta 540 
ccggctcaga agttggacga agccactgcc cagcgcgatg cggccatcgc tacgcaaaaa 600 
gcggccgaag cccagtacaa tatggctcgc aacggtgccg aacgcgaaga caagctggca 660 
gcttctgccc tcgtcgatag agcgagagga gccgtcgccg aggtggagtc gtacatcaac 720 
gaaacctacc tcatcgcccc acgggcaggc gaagtgtcgg agatattccc caaagccggc 
gaactcgtag gtaccggcgc acctatcatg aatatcgccg agatgggcga tatgtgggcc 
agctttgccg ttcgtgagga tttcctcagc agcatgacca tgggagccgt tctggagact 
gtggtgccgg ctctgaatga agaaaaagta cgcttcaaga tcacattcat caagaacatg 
ggtacctatg ctgcctggaa agcgaccaag acaacagggc agtacgacct gaagaccttc 1020 
gaggtaaagg ccacccttgc ggataaagac aaggcacaaa agctacgccc gggtatgtcc 1080 
gtgatcatac gcaag 1095 



780 
840 
900 
960 



(2) INFORMATION FOR SEQ ID NO: 5 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 693 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 



(ii) MOLECULE TYPE: DNA (g nomic) 
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(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 



(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION 1..-693 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5 



acgaat:aaaa 
gctctcgtgt 
ggagtaggag 
gccatcgtcg 
aagcagaaaa 
ggagaggcta 
ctgagtccca 
gacacggata 
cct-ctgtctg 
atgtcgcgca 
gttgccggac 
atcgaacaag 



aagaagagac 
tcgccggttg 
gtgccattgg 
gtactgcagt 
aagaactgga 
ttctggttac 
actcacgcac 
ttcgtatcgt 
agagacgtgc 
tggcagccga 
gttcggccaa 
cacagcaagg 



aatgaagaaa 
cggactgaac 
tgccggagta 
cggtggagca 
ggccgcagta 
tttcgatagc 
tgcgctgacg 
aggccatacg 
agccagcgta 
agggcgtggg 
ccgccgtgtg 
tacgctgaag 



tcaagtgtag 
aatatggcaa 
ggtaacgtag 
gccggtgctc 
cccgatgcta 
ggtatcctct 
aagtttgctg 
gacaataccg 
tattctttcc 
agccatgaac 
gaggtttata 
taa 



tagcctcagt 
aaggcggcct 
ccggaaatac 
tcatcggaaa 
cgattcagac 
ttgcgacgaa 
caaacatgaa 
gctccgacaa 
tgaattctca 
cggttgcaga 
tcttgccgaa 



tttggccgtg 
tatcggcgcc 
ggctgtcggt 
gaagatggac 
agtaaatgac 
ctccagcact 
caaaaacccc 
gatcaacgat 
gggtgtgagt 
caatagcaca 
tgccaagatg 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
693 



(2) INFORMATION FOR SEQ ID NO: 6 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1506 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 



(iii) HYPOTHETICAL: NO 



(iv) ANTI-SENSE: NO 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 

(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION 1.,.1506 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6 



60 



gaaggggcca gcttcgtcct tgtcctcacc ggcaagggct tcgtgccggc tcgctcttgg 

gagcgcgtga acgaggctct cgaaaaagac gaaatcgtaa agggctatgt gaagtgtcgt 120 

accaagggtg gtatgatcgt cgatgtattc ggtatcgagg ctttcctccc gggatcacag 180 

atcgacgtgc gccccattcg cgactacgat gcattcgttg agaagacgat ggagttcaag 240 

attgtgaaaa tcaatcaaga atataagaat gtagttgttt cccacaaggt gctcatcgaa 300 

gcagagctcg aacaacagaa gaaagaaatc atcggcaagc tcgaaaaagg gcaggtactc 360 

gaaggtatcg tcaagaatat tacttcttac ggagtattta tcgacctcgg tggagtggat 420 

ggtcttatcc atatcactga cctttcatgg ggtcgtgtgg ctcatccgga agaaatcgta 480 

cagctggatc agaagatcaa tgtcgttatc ctcgactttg atgaagatcg caagcgtatc 540 

gctctcggac tcaaacagct gatgcctcat ccttgggatg ctctcgacag cgagcttaag 600 

gtaggcgata aggtgaaggg taaagttgtg gtgatggcag attacggtgc tttcgttgag 660 
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littgcacagg gcgttgaggg tcttatccac gtaagcgaaa tgtcatggac acagcacttg 720 

cgttctgctc aggacttcct gcatgtaggc gacgaagtgg aagccgtgat cctgacgctc 780 

gaccgcgaag aacgcaaaat gtcgctcggt ctgaagcaac tcaagccgga tccttgggct 840 

gatatcgaaa ctcgtttccc tgtaggctct cgtcaccatg ctcgtgttcg caacttcacc 900 

aatttcggtg tattcgttga gatcgaagag ggcgtagatg gccttatcca tatttccgac 960 

ctttcttgga cgaagaagat caaacacccc agcgagttta cggaagtagg tgctgatatc 1020 

gaagttcagg taatcgagat cgacaaggaa aaccgtcgtc tcagcttggg tcacaaacag 1080 

ttggaagaga atccttggga tgtattcgag acggtattca ctgtaggatc tatccacgaa 1140 

ggaacggtaa tcgaagtgat ggacaagggt gctgtcgttt ctctgcctta cggtgtggaa 1200 

ggttttgcca ctccgaagca catggtgaag gaagatggct cacaggctgt actcgaagag 1260 

aagttacctt tcaaggttat tgagttcaat aaggatgcca agcgaatcat tgtatctcat 1320 

agccgtgtat tcgaagatga gcagaaaatg gctcagcgtg aagccaatgc agagcgtaag 1380 

gctgaagcca aagcggctca gaaagaagct gctgccgaag ctgccaatcc tgcacaggct 1440 

gtagagaaag ccactctcgg agacctcggc gagctggccg ctttgaaaga aaagctttca 1500 

gaaaac 1506 



(2) INFORMATION FOR SEQ ID NO: 7 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 320 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

( iii ) HYPOTHETICAL : YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingiva lis 

(ix) FEATURE: 

(A) NAME/KEY: miscfeature 

(B) LOCATION 1,..320 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7 



Pro 


Gin 


Ser 


Ser 


Pro 


Asp 


Arg 


Arg 


Ser 


Phe 


Gin 


Asn 


Val 


Met 


Asn 


Lys 


1 








5 










10 










15 




Tyr 


His 


Ser 


Gin 


Ser 


Val 


Leu 


Glu 


Val 


Gly 


Lys 


He 


Gly 


He 


Val 


He 








20 










25 










30 






He 


Phe 


Ala 


Pro 


He 


Val 


Arg 


Asn 


Val 


His 


Gin 


Gin 


Pro 


Pro 


Phe 


Leu 






35 










40 










45 








Ser 


His 


Lys 


Ser 


He 


Met 


Arg 


He 


Val 


Ser 


Asn 


Phe 


Leu 


Phe 


Val 


Ser 




50 










55 










60 










Phe 


Ser 


Val 


Leu 


Leu 


Phe 


Ala 


Ser 


Cys 


Arg 


Ser 


Gin 


Arg 


Glu 


Lys 


Val 


65 










70 










75 










80 


Val 


Tyr 


Leu 


Gin 


Asp 


He 


Gin 


Thr 


Phe 


Asn 


Arg 


Glu 


He 


He 


Ala 


Lys 










85 










90 










95 




Pro 


Tyr 


Asp 


Val 


Lys 


He 


Glu 


Lys 


Asp 


Asp 


Val 


Leu 


Asn 


He 


Leu 


Val 








100 










105 










110 






Ser 


Ser 


Arg 


Asp 


Pro 


Glu 


Leu 


Ser 


Thr 


Pro 


Tyr 


Asn 


Gin 


Val 


Leu 


Thr 






115 










120 










125 








Thr 


Arg 


Ala 


Leu 


Ala 


Arg 


Asn 


Gly 


Tyr 


Gly 


Thr 


Asn 


Ser 


Asn 


Glu 


Gly 




130 










135 










140 










Phe 


Leu 


Val 


Asp 


Ser 


Lys 


Gly 


Tyr 


11 


Asn 


Tyr 


Pro 


He 


Leu Gly Gin 


145 










150 










155 










160 


He 


Tyr 


Val 


Glu 


Gly 


Leu 


Thr 


Arg 


Thr 


Glu 


Leu 


Glu 


Lys 


Glu 


He 


Gin 










165 










170 










175 




Lys 


Arg 


He 


He 


S r 


Ser 


Gly 


Phe 


He 


Lys 


Asp 


Pro 


Thr 


Val 


Thr 


Val 








180 










185 










190 






Gin 


Leu 


Gin 


Asn 


Phe 


Lys 


Val 


Ser 


Val 


Leu 


Gly 


Glu 


Val 


Asn 


His 


Pro 







195 










200 






Glv 


Ser 


Met 


Ser 


Val 


Lys 


Gly 


Glu 


Arg 


He 


210 










215 








Gly 


Met 


Ala 


Gly 


Asp 


Leu 


Thr 


He 


Tyr 


Gly 


225 










230 








Glu 


Val 


He 


Arg 


Glu 


Thr 


Asp 


Gly 


His 


Arg 






245 










250 


Arg 


Lys 


Ala 


Asp 


Leu 


Leu 


Ala 


Ser 


Pro 


Val 




260 










265 




Asp 


Val 


He 


Tyr 


Val 


Glu 


Pro 


Asn 


Asp 


Lys 




275 










280 






lie 


Asn 


Gin 


Asn 


Asn 


Asn 


Val 


Asn 


Val 


Trp 




290 










295 








Leu 


Val 


Ser 


He 


Ser 


Thr 


Leu 


Thr 


He 


Thr 


305 










310 










(2) 


INFORMATION 


FOR 


SEQ 


ID 


NO: 8 
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205 

Thr Leu Leu Glu Ala He 
220 

Arg Arg Asp Arg Val Phe 
235 240 
Val Ph Gin Thr Asp Leu 
255 

Tyr Tyr Leu His Gin Asn 
270 

Lys Thr Gin Met Ser Glu 
285 

Leu Ser Val Thr Ser Thr 
300 

He He Asp Lys Thr Lys 
315 320 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 446 amino acids 

(B) TYPE: amino acid 
( D ) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 

(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION 1 446 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8 

Asn Pro Lys Lys Ala Leu Ala Leu Asn Leu Lys Gly Lys Pro Leu Pro 

1 5 10 15 

Glu Met Leu Ala Glu Pro Ala Gin Ser Pro Thr Tyr Ala Val Val Pro 

20 25 30 

ASP Asp Phe Glu Gly Val He Pro Lys Val Thr Ala Arg Pro Gly Asp 

35 40 45 

Lys val Arg Ala Gly Ser Ala Leu Met His His Lys Ala Tyr Pro Glu 

50 55 60 

Met Lys Phe Thr Ser Pro Val Ser Gly Glu Val He Ala Val Asn Arg 
65 70 75 80 

Gly Ala Lys Arg Lys Val Leu Ser He Glu Val Lys Pro Asp Gly Leu 

85 90 95 

Asn Glu Tyr Glu Ser Phe Pro Val Gly Asp Pro Ser Ala Leu Ser Ala 

100 105 110 

Glu Gin He Lys Glu Leu Leu Leu Ser Ser Gly Met Trp Gly Phe He 

115 120 125 

Lys Gin Arg Pro Tyr Asp He Val Ala Thr Pro Asp He Ala Pro Arg 

130 135 140 

Asp II Tyr He Thr Ala Asn Phe Thr Ala Pro L u Ala Pro Asp Phe 

Isp Phe He val Arg Gly Glu Glu Arg Ala Leu Gin Thr Ala He Asp 

165 170 175 

Ala Leu Ala Lys Leu Thr Thr Gly Lys Val Tyr Val Gly Leu Lys Pro 

180 185 190 

Gly Ser Ser Leu Gly Leu His Asn Ala Glu He Val Glu Val His Gly 



195 

Pro His Pro Ala 
210 

lie Asn Arg Gly 
225 

Val lie Gly Arg 

lie Ala Met Thr 
260 

Met Pro Gly Cys 
275 

Lys Glu Ser His 
290 

Lys Leu Cys Glu 
305 

Thr Val lie Pro 

Ala Pro Arg Leu 
340 

Leu Gin Gly Lys 
355 

Gly Glu Arg Ala 
370 

Met Asp lie Tyr 
385 

lie Asp Lys Met 

Phe Ala Thr Cys 
420 

lie Val Arg Glu 
435 



200 

Gly Asn Val Gly 
215 

Glu Thr Val Trp 
230 

Phe Leu Leu Thr 
245 

Gly Ser Asp Ala 

Asn Val Phe Ala 
280 

Glu Arg Val lie 
295 

Lys Glu Pro Phe 
310 

Glu Gly Asp Asp 
325 

Asp Gin Tyr Ser 

Asn Lys Glu Tyr 
360 

Met lie Met Ser 
375 

Pro Glu Tyr Leu 
390 

Glu Asp Leu Gly 

405 

Glu Phe Val Asp 

Gly Leu Asp Met 
440 
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Val Leu lie Asn 
220 

Thr Leu Lys Ala 
235 

Gly Lys Ala Asp 
250 

Ala Ala His Gly 
265 

Ser Phe Pro Gly 

Asp Gly Asn Val 
300 

Leu Ser Ala Arg 
315 

Val Asp Glu Leu 
330 

Met Ser Arg Ala 
345 

Val Leu Asp Ala 

Asn Glu Tyr Asp 
380 

Leu Lys Ala lie 
395 

lie Tyr Glu Val 
410 

Thr Ser Lys lie 
425 

Leu Tyr Lys Glu 



205 

His Thr Lys Pro 

Thr Asp Leu II 
240 

Phe Thr Arg Met 
255 

Tyr Val Arg lie 
270 

Arg Leu Thr lie 
285 

Leu Thr Gly Lys 

Cys Asp Gin lie 
320 

Phe Gly Trp Ala 
335 

Tyr Phe Ser Trp 
350 

Arg lie Lys Gly 
365 

Arg Val Phe Pro 

lie Ala Phe Asp 
400 

Ala Pro Glu Asp 

415 

Glu Leu Gin Arg 

430 
Met Asn 
445 



(2) INFORMATION FOR SEQ ID NO : 9 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 329 amino acids 

(B) TYPE: amino acid 
( D ) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 

( iii ) HYPOTHETICAL : YES 

(vi) ORIGINAL SOURCE: 

( A ) ORGANISM : Porphyromonas gingivalis 

(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION l..,329 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9 

Asn Leu Arg Thr Gin Pro Phe Phe Leu Val Asp Leu Gin Ser Met He 

15 10 15 

Arg Thr II Leu Ser Arg Tyr Val Ser Ser Asn Phe Trp Ser Arg Gly 

20 25 30 

Ala Thr Phe Phe Ph Thr He Ph Pro Ala Phe He Leu Ala Ala Thr 

35 40 45 

Ala Leu Pro Ala Cys Gly Gly Gly Thr Ala Ser Gly Ser Asp Arg Thr 

50 55 60 

Leu Ala Val Thr He Glu Pro Gin Lys Tyr Phe He Glu Ser He Ala 
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70 75 80 

ASP Lys Ser Val Gin Val Val Ala Leu Val Pro Ala Gly Ser Asn Pro 

85 90 95 

Glu Glu Tyr Asp Pro Ser Pro Thr Val Met Lys Arg Leu Ser Glu Ala 

100 105 110 

ASD Ala Tyr Phe Tyr lie Gly Gly Leu Gly Phe Glu Gin Arg Asn Leu 

^ 115 120 125 

Ala Ala lie Arg Asp Asn Asn Pro Lys Leu Pro Leu Phe Glu Met Gly 

130 135 140 

Lys Ala Leu Ala Asp Ala Gly Ser Ala Asp Leu His Gly Ser Cys Thr 
145 150 155 160 

ASP His Ser His Thr Asp Leu His Ala His Asp Pro His Tyr Trp Ser 

165 170 175 

Ser Val Val Gly Ala Lys Ala Leu Ser Arg Ala Ala Tyr Asp Ala Leu 

180 185 190 

Val Glu Leu Tyr Pro Asn Glu Lys Asp Lys Trp Asp Lys Gly His Asp 

195 200 205 

Arg Leu Asn Gly Arg lie Asp Ser Val Lys Arg Leu Val Asp Thr Met 

210 215 220 

Phe Ala Asn Gly Lys Ala Asp Lys Ala Phe Val lie Tyr His Pro Ser 
225 230 235 240 

Leu Ser Phe Phe Ala Gin Glu Phe Gly Leu Arg Gin He Val He Glu 

245 250 255 

Glu Asp Gly Lys Glu Pro Thr Ala Ala His Leu Arg Arg Val He Asp 

260 265 270 

Gin Ala Arg Ala Asp Gly Val Arg He Val Phe He Gin Pro Glu Phe 

275 280 285 

Glu Thr Arg Gin Ala Glu Asp He Ala Arg Glu He Gly Ala Arg Pro 

290 295 300 

Val Arg He Asn Pro Leu Arg Ser Ser Trp Glu Glu Glu He Leu His 
305 310 315 320 

He Ala Arg Ala Leu Ala His Glu Arg 
325 



(2) INFORMATION FOR SEQ ID NO: 10 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 365 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 

(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION 1-..365 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10 

Cvs Cys Ala Arg Leu Arg Asp Thr Ser Pro Arg Lys Lys Ser Ph Arg 

15 10 15 

Asn Asp Thr Val Phe Pro Tyr Phe Val He His L u He Lys Hxs He 

20 25 30 

II He M t Thr Asp Asn Lys Gin Arg Asn He Val Phe Pro Ala Phe 

35 40 45 

Leu Leu L u L u Gly Val He Ala Val Val Thr He Val Gly Phe Phe 
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50 55 60 

Met Leu Arg Pro Ala Glu Glu He He Gin Gly Gin He Glu Val Thr 
65 70 75 80 

Glu Tyr Arg Val Ser Ser Lys Val Pro Gly Arg He Lys Glu Leu Arg 

85 90 95 

Val Ser Glu Gly Gin Gin Val Gin Ala Gly Asp Thr Leu Ala Val He 

100 105 110 

Glu Ala Pro Asp Val Ala Ala Lys Met Glu Gin Ala Lys Ala Ala Glu 

115 120 125 

Ala Ala Ala Gin Ala Gin Asn Ala Lys Ala Leu Lys Gly Ala Arg Ser 

130 135 140 

Glu Gin He Gin Ala Ala Tyr Glu Met Trp Gin Lys Ala Gin Ala Gly 
145 150 155 160 

Val Ala He Ala Thr Lys Thr His Gin Arg Val Gin Asn Leu Tyr Asp 

165 170 175 

Gin Gly Val Val Pro Ala Gin Lys Leu Asp Glu Ala Thr Ala Gin Arg 

180 185 190 

Asp Ala Ala He Ala Thr Gin Lys Ala Ala Glu Ala Gin Tyr Asn Met 

195 200 205 

Ala Arg Asn Gly Ala Glu Arg Glu Asp Lys Leu Ala Ala Ser Ala Leu 

210 215 220 

Val Asp Arg Ala Arg Gly Ala Val Ala Glu Val Glu Ser Tyr He Asn 
225 230 235 240 

Glu Thr Tyr Leu He Ala Pro Arg Ala Gly Glu Val Ser Glu He Phe 

245 250 255 

Pro Lys Ala Gly Glu Leu Val Gly Thr Gly Ala Pro He Met Asn He 

260 265 270 

Ala Glu Met Gly Asp Met Trp Ala Ser Phe Ala Val Arg Glu Asp Phe 

275 280 285 

Leu Ser Ser Met Thr Met Gly Ala Val Leu Glu Thr Val Val Pro Ala 

290 295 300 

Leu Asn Glu Glu Lys Val Arg Phe Lys He Thr Phe He Lys Asn Met 
305 310 315 320 

Gly Thr Tyr Ala Ala Trp Lys Ala Thr Lys Thr Thr Gly Gin Tyr Asp 

325 330 335 

Leu Lys Thr Phe Glu Val Lys Ala Thr Leu Ala Asp Lys Asp Lys Ala 

340 345 350 

Gin Lys Leu Arg Pro Gly Met Ser Val He He Arg Lys 
355 360 365 



(2) INFORMATION FOR SEQ ID NO: 11 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 230 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 

(ix) FEATURE: 

(A) NAME /KEY: misc_f atur 

(B) LOCATION 1. . .230 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11 
Thr Asn Lys Lys Glu Glu Thr Met Lys Lys Ser Ser Val Val Ala 



Val Leu Ala Val 
20 

Ala Lys Gly Gly 
35 

Gly Val Gly Asn 
50 

Thr Ala Val Gly 
65 

Lys Gin Lys Lys 

Thr Val Asn Asp 
100 

Leu Phe Ala Thr 
115 

Leu Thr Lys Phe 
130 

Arg lie Val Gly 
145 

Pro Leu Ser Glu 

Gin Gly Val Ser 
180 

Glu Pro Val Ala 
195 

Arg Val Glu Val 
210 

Gin Gin Gly Thr 
225 



5 

Ala Leu Val Phe 

Leu lie Gly Ala 
40 

Val Ala Gly Asn 

55 

Gly Ala Ala Gly 
70 

Glu Leu Glu Ala 
85 

Gly Glu Ala lie 

Asn Ser Ser Thr 
120 

Ala Ala Asn Met 
135 

His Thr Asp Asn 
150 

Arg Arg Ala Ala 
165 

Met Ser Arg Met 

Asp Asn Ser Thr 
200 

Tyr lie Leu Pro 
215 

Leu Lys 
230 



20 

10 

Ala Gly Cys Gly 
25 

Gly Val Gly Gly 

Thr Ala Val Gly 
60 

Ala Leu lie Gly 
75 

Ala Val Pro Asp 
90 

Leu Val Thr Phe 
105 

Leu Ser Pro Asn 

Asn Lys Asn Pro 
140 

Thr Gly Ser Asp 
155 

Ser Val Tyr Ser 
170 

Ala Ala Glu Gly 
185 

Val Ala Gly Arg 

Asn Ala Lys Met 
220 



15 

Leu Asn Asn Met 
30 

Ala He Gly Ala 
45 

Ala He Val Gly 

Lys Lys Met Asp 
80 

Ala Thr He Gin 
95 

Asp Ser Gly He 
110 

Ser Arg Thr Ala 
125 

Asp Thr Asp He 

Lys He Asn Asp 
160 

Phe Leu Asn Ser 
175 

Arg Gly Ser His 
190 

Ser Ala Asn Arg 
205 

He Glu Gin Ala 



(2) INFORMATION FOR SEQ ID NO: 12 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 502 amino acids 

(B) TYPE: aunino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 



(ix) FEATUItE: 

(A) NAME/KEY: ndsc_feature 

(B) LOCATION 1-..502 





(xi) 


SEQUENCE DESCRIPTION: SEQ ID NO: 12 












Glu 


Gly 


Ala 


Ser 


Phe 


Val 


Leu 


Val 


Leu 


Thr 


Gly 


Lys 


Gly 


Phe 


Val 


Pro 


1 






5 










10 










15 




Ala 


Arg 


Ser 


Trp 


Glu 


Arg 


Val 


Asn 


Glu 


Ala 


Leu 


Glu 


Lys 


Asp 


Glu 


He 






20 










25 










30 






Val 


Lys 


Gly 


Xyr 


Val 


Lys 


Cys 


Arg 


Thr 


Lys 


Gly 


Gly 


Met 


He 


Val 


Asp 




35 










40 










45 








Val 


Phe 


Gly 


He 


Glu 


Ala 


Phe 


Leu 


Pro 


Gly 


Ser 


Gin 


He 


Asp 


Val 


Arg 




50 








55 










60 










Pro 


11 


Arg 


Asp 


Tyr 


Asp 


Ala 


Phe 


Val 


Glu 


Lys 


Thr 


M t 


Glu 


Phe 


Lys 


65 






70 










75 








His 


80 


11 


Val 


Lys 


He 


Asn 


Gin 


Glu 


Tyr 


Lys 


Asn 


Val 


Val 


Val 


Ser 


Lys 
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85 90 95 

Val Leu He Glu Ala Glu Leu Glu Gin Gin Lys Lys Glu He He Gly 

100 105 110 

Lys Leu Glu Lys Gly Gin Val Leu Glu Gly He Val Lys Asn He Thr 

115 120 125 

Ser Tyr Gly Val Phe He Asp Leu Gly Gly Val Asp Gly Leu He His 

130 135 140 

He Thr Asp Leu Ser Trp Gly Arg Val Ala His Pro Glu Glu He Val 
45 150 155 160 

Gin Leu Asp Gin Lys He Asn Val Val He Leu Asp Phe Asp Glu Asp 

165 170 175 

Arq Lys Arg He Ala Leu Gly Leu Lys Gin Leu Met Pro His Pro Trp 

180 185 190 

ASP Ala Leu Asp Ser Glu Leu Lys Val Gly Asp Lys Val Lys Gly Lys 

195 200 205 

Val Val Val Met Ala Asp Tyr Gly Ala Phe Val Glu He Ala Gin Gly 

210 215 220 

val Glu Gly Leu He His Val Ser Glu Met Ser Trp Thr Gin His Leu 
225 230 235 240 

Arg ser Ala Gin Asp Phe Leu His Val Gly Asp Glu Val Glu Ala Val 

245 250 255 

He Leu Thr Leu Asp Arg Glu Glu Arg Lys Met Ser Leu Gly Leu Lys 

260 265 270 

Gin Leu Lys Pro Asp Pro Trp Ala Asp He Glu Thr Arg Phe Pro Val 

275 280 285 

Glv Ser Arg His His Ala Arg Val Arg Asn Phe Thr Asn Phe Gly Val 

290 295 300 

Phe Val Glu He Glu Glu Gly Val Asp Gly Leu He His He Ser Asp 
305 310 315 320 

Leu ser Trp Thr Lys Lys He Lys His Pro Ser Glu Phe Thr Glu Val 

325 330 335 

Glv Ala Asp He Glu Val Gin Val He Glu He Asp Lys Glu Asn Arg 

340 345 350 

Arg Leu Ser Leu Gly His Lys Gin Leu Glu Glu Asn Pro Trp Asp Val 

355 360 365 

Phe Glu Thr Val Phe Thr Val Gly Ser He His Glu Gly Thr Val He 

370 375 380 

Glu val Met Asp Lys Gly Ala Val Val Ser Leu Pro Tyr Gly Val Glu 
385 390 395 400 

Glv Phe Ala Thr Pro Lys His Met Val Lys Glu Asp Gly Ser Gin Ala 

405 410 415 

val Leu Glu Glu Lys Leu Pro Phe Lys Val He Glu Phe Asn Lys Asp 

420 425 430 

Ala Lys Arg He He Val Ser His Ser Arg Val Phe Glu Asp Glu Gin 

435 440 445 

Lys Met Ala Gin Arg Glu Ala Asn Ala Glu Arg Lys Ala Glu Ala Lys 

450 455 460 

Ala Ala Gin Lys Glu Ala Ala Ala Glu Ala Ala Asn Pro Ala Gin Ala 
465 470 475 480 

Val Glu Lys Ala Thr Leu Gly Asp Leu Gly Glu Leu Ala Ala Leu Lys 

485 490 495 

Glu Lys Leu Ser Glu Asn 
500 



(2) INFORMATION FOR SEQ ID NO: 13 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 307 amino acids 

(B) TYPE: amino acid 
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(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 

(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION 1 . . .307 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13 

Met Asn Lys Tyr His Ser Gin Ser Val Leu Glu Val Gly Lys He Gly 

15 10 15 

He Val He He Phe Ala Pro He Val Arg Asn Val His Gin Gin Pro 

20 25 30 

Pro Phe Leu Ser His Lys Ser He Met Arg He Val Ser Asn Phe Leu 

35 40 45 

Phe Val Ser Phe Ser Val Leu Leu Phe Ala Ser Cys Arg Ser Gin Arg 

50 55 60 

Glu Lys Val Val Tyr Leu Gin Asp He Gin Thr Phe Asn Arg Glu He 
65 70 75 80 

He Ala Lys Pro Tyr Asp Val Lys He Glu Lys Asp Asp Val Leu Asn 

85 90 95 

He Leu Val Ser Ser Arg Asp Pro Glu Leu Ser Thr Pro Tyr Asn Gin 

100 105 110 

Val Leu Thr Thr Arg Ala Leu Ala Arg Asn Gly Tyr Gly Thr Asn Ser 

115 120 125 

Asn Glu Gly Phe Leu Val Asp Ser Lys Gly Tyr He Asn Tyr Pro He 

130 135 140 

Leu Gly Gin He Tyr Val Glu Gly Leu Thr Arg Thr Glu Leu Glu Lys 
145 150 155 160 

Glu He Gin Lys Arg He He Ser Ser Gly Phe He Lys Asp Pro Thr 

165 170 175 

Val Thr Val Gin Leu Gin Asn Phe Lys Val Ser Val Leu Gly Glu Val 

180 185 190 

Asn His Pro Gly Ser Met Ser Val Lys Gly Glu Arg He Thr Leu Leu 

195 200 205 

Glu Ala He Gly Met Ala Gly Asp Leu Thr He Tyr Gly Arg Arg Asp 

210 215 220 

Arg Val Phe Val He Arg Glu Thr Asp Gly His Arg Glu Val Phe Gin 
225 230 235 240 

Thr Asp Leu Arg Lys Ala Asp Leu Leu Ala Ser Pro Val Tyr Tyr Leu 

245 250 255 

His Gin Asn Asp Val He Tyr Val Glu Pro Asn Asp Lys Lys Thr Gin 

260 265 270 

Met Ser Glu He Asn Gin Asn Asn Asn Val Asn Val Trp Leu Ser Val 

275 280 285 

Thr Ser Thr Leu Val Ser He Ser Thr Leu Thr He Thr He He Asp 

290 295 300 

Lys Thr Lys 
305 



(2) INFORMATION FOR SEQ ID NO: 14 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 429 amino acids 

(B) TYPE: amino acid 
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(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

( iii ) HYPOTHETICAL : YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 



(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION 1. . -429 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14 

Met Leu Ala Glu Pro Ala Gin Ser Pro Thr Tyr Ala Val Val Pro Asp 

15 10 15 

Asp Phe Glu Gly Val He Pro Lys Val Thr Ala Arg Pro Gly Asp Lys 

20 25 30 

Val Arg Ala Gly Ser Ala Leu Met His His Lys Ala Tyr Pro Glu Met 

35 40 45 

Lys Phe Thr Ser Pro Val Ser Gly Glu Val He Ala Val Asn Arg Gly 

50 55 60 

Ala Lvs Arg Lys Val Leu Ser He Glu Val Lys Pro Asp Gly Leu Asn 
65 70 75 80 

Glu Tyr Glu Ser Phe Pro Val Gly Asp Pro Ser Ala Leu Ser Ala Glu 

85 90 95 

Gin He Lys Glu Leu Leu Leu Ser Ser Gly Met Trp Gly Phe He Lys 

100 105 110 

Gin Arg Pro Tyr Asp He Val Ala Thr Pro Asp He Ala Pro Arg Asp 

115 120 125 

He Tyr He Thr Ala Asn Phe Thr Ala Pro Leu Ala Pro Asp Phe Asp 

130 135 140 

Phe He val Arg Gly Glu Glu Arg Ala Leu Gin Thr Ala He Asp Ala 
145 150 155 160 

Leu Ala Lys Leu Thr Thr Gly Lys Val Tyr Val Gly Leu Lys Pro Gly 

165 170 175 

Ser Ser Leu Gly Leu His Asn Ala Glu He Val Glu Val His Gly Pro 

180 185 190 

His Pro Ala Gly Asn Val Gly Val Leu He Asn His Thr Lys Pro He 

195 200 205 

Asn Arg Gly Glu Thr Val Trp Thr Leu Lys Ala Thr Asp Leu He Val 

210 215 220 

He Gly Arg Phe Leu Leu Thr Gly Lys Ala Asp Phe Thr Arg Met He 
225 230 235 240 

Ala Met Thr Gly Ser Asp Ala Ala Ala His Gly Tyr Val Arg He Met 

245 250 255 

Pro Gly Cys Asn Val Phe Ala Ser Phe Pro Gly Arg Leu Thr He Lys 

260 265 270 

Glu Ser His Glu Arg Val He Asp Gly Asn Val Leu Thr Gly Lys Lys 

275 280 285 

Leu Cys Glu Lys Glu Pro Phe Leu Ser Ala Arg Cys Asp Gin He Thr 

290 295 300 

val He Pro Glu Gly Asp Asp Val Asp Glu Leu Phe Gly Trp Ala Ala 
305 310 315 320 

Pro Arg Leu Asp Gin Tyr Ser Met Ser Arg Ala Tyr Phe Ser Trp Leu 

325 330 335 

Gin Gly Lys Asn Lys Glu Tyr Val Leu Asp Ala Arg He Lys Gly Gly 

340 345 350 

Glu Arg Ala Met He Met Ser Asn Glu Tyr Asp Arg Val Phe Pro Met 

355 360 365 

Asp He Tyr Pro Glu Tyr Leu Leu Lys Ala He He Ala Phe Asp He 
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375 380 

ASP I.ys Met Glu ASP Leu Gly He Tyr Glu Val Ala Pro Glu Asp Phe 



ila Thr Cys Glu Phe Val Asp Thr Ser Lys He Glu Leu Gin Arg He 

405 410 ^-L^ 

val Arg Glu Gly Leu Asp Met Leu Tyr Lys Glu Met Asn 
420 425 



390 



(2) INFORMATION FOR SEQ ID NO: 15 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 315 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 

(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION 1...315 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15 

Met He Arg Thr He Leu Ser Arg Tyr Val Ser Ser Asn Phe Trp Ser 

Lg Gly Ala Thr Phe Phe Phe Thr He Phe Pro Ala Phe He Leu Ala 

Ala Thr Ala Leu Pro Ala Cys Gly Gly Gly Thr Ala Ser Gly Ser Asp 
35 40 45 



Arg Thr Leu Ala Val Thr He Glu Pro Gin Lys Tyr Phe He Glu Ser 

He ASP Lys Ser Val Gin Val Val Ala Leu Val Pro Ala Gly Ser 

70 75 



Isn Pro Glu Glu Tyr Asp Pro Ser Pro Thr Val Met Lys Arg Leu Ser 
85 

Glu Ala ASP Ala Tyr Phe Tyr He Gly Gly Leu Gly Phe Glu Gin Arg 

100 1*^^ 
Asn Leu Ala Ala He Arg Asp Asn Asn Pro Lys Leu Pro Leu Phe Glu 

Met Gly III Ala Leu Ala Asp Ala Gly Ser Ala Asp Leu His Gly Ser 

130 135 140 

cys Thr ASP His Ser His Thr Asp Leu His Ala His Asp Pro His Tyr 

irp ser Ser Val Val ^ly Ala Lys Ala Leu Ser Arg Ala Ala Tyr Asp 

165 170 '■'^ 

Ala Leu Val Glu Leu Tyr Pro Asn Glu Lys Asp Lys Trp Asp Lys Gly 

His ASP Arg ieu Asn Gly Arg He Asp Ser Val Lys Arg Leu Val Asp 

195 200 205 

Thr Met Phe Ala Asn Gly Lys Ala Asp Lys Ala Phe Val He Tyr Hxs 

oin 215 220 

Pro S r L u ser Phe Phe Ala Gin Glu Ph Gly Leu Arg Gin He Val 

He Glu Glu ASP Gly lJs Glu Pro Thr Ala Ala His Leu Arg Arg Val 

245 250 
H ASP Gin Ala Arg Ala Asp Gly Val Arg He Val Phe He Gin Pro 
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260 265 270 

Glu Phe Glu Thr Arg Gin Ala Glu Asp He Ala Arg Glu He Gly Ala 

275 280 285 

Arg Pro Val Arg He Asn Pro Leu Arg Ser Ser Trp Glu Glu Glu He 

290 295 300 

Leu His He Ala Arg Ala Leu Ala His Glu Arg 
305 310 315 



(2) INFORMATION FOR SEQ ID NO: 16 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 331 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 

(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION 1 331 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16 

Met Thr Asp Asn Lys Gin Arg Asn He Val Phe Pro Ala Phe Leu Leu 

15 10 15 

Leu Leu Gly Val He Ala Val Val Thr He Val Gly Phe Phe Met Leu 

20 25 30 

Arg Pro Ala Glu Glu He He Gin Gly Gin He Glu Val Thr Glu Tyr 

35 40 45 

Arg Val Ser Ser Lys Val Pro Gly Arg He Lys Glu Leu Arg Val Ser 

50 55 60 

Glu Gly Gin Gin Val Gin Ala Gly Asp Thr Leu Ala Val He Glu Ala 
65 70 75 80 

Pro Asp Val Ala Ala Lys Met Glu Gin Ala Lys Ala Ala Glu Ala Ala 

85 90 95 

Ala Gin Ala Gin Asn Ala Lys Ala Leu Lys Gly Ala Arg Ser Glu Gin 

100 105 110 

He Gin Ala Ala Tyr Glu Met Trp Gin Lys Ala Gin Ala Gly Val Ala 

115 120 125 

He Ala Thr Lys Thr His Gin Arg Val Gin Asn Leu Tyr Asp Gin Gly 

130 135 140 

Val Val Pro Ala Gin Lys Leu Asp Glu Ala Thr Ala Gin Arg Asp Ala 
145 150 155 160 

Ala He Ala Thr Gin Lys Ala Ala Glu Ala Gin Tyr Asn Met Ala Arg 

165 170 175 

Asn Gly Ala Glu Arg Glu Asp Lys Leu Ala Ala Ser Ala Leu Val Asp 

180 185 190 

Arg Ala Arg Gly Ala Val Ala Glu Val Glu Ser Tyr He Asn Glu Thr 

195 200 205 

Tyr Leu H Ala Pro Arg Ala Gly Glu Val Ser Glu He Phe Pro Lys 

210 215 220 

Ala Gly Glu Leu Val Gly Thr Gly Ala Pro He Met Asn He Ala Glu 
225 230 235 240 

Met Gly Asp Met Trp Ala Ser Phe Ala Val Arg Glu Asp Phe Leu Ser 

245 250 255 

Ser Met Thr Met Gly Ala Val Leu Glu Thr Val Val Pro Ala Leu Asn 



260 

Glu <5lu Lys Val Arg 
275 

Tyr Ala Ala Trp Lys 
290 

Thr Phe Glu Val Lys 
305 

Leu Arg Pro Gly Met 
325 
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265 

Phe Lys lie Thr Phe lie 
280 

Ala Thr Lys Thr Thr Gly 
295 

Ala Thr Leu Ala Asp Lys 
310 315 
Ser Val lie lie Arg Lys 
330 



270 

Lys Asn Met Gly Thr 
285 

Gin Tyr Asp Leu Lys 
300 

Asp Lys Ala Gin Lys 
320 



(2) INFORMATION FOR SEQ ID NO: 17 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 223 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 

(ix) FEATURE: 

(A) NAME/KEY: miscfeature 

(B) LOCATION 1 223 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17 

Met Lys Lys Ser Ser Val Val Ala Ser Val Leu Ala Val Ala Leu Val 

15 10 15 

Phe Ala Gly Cys Gly Leu Asn Asn Met Ala Lys Gly Gly Leu lie Gly 

20 25 30 

Ala Gly Val Gly Gly Ala lie Gly Ala Gly Val Gly Asn Val Ala Gly 

35 40 45 

Asn Thr Ala Val Gly Ala He Val Gly Thr Ala Val Gly Gly Ala Ala 

50 55 60 

Glv Ala Leu He Gly Lys Lys Met Asp Lys Gin Lys Lys Glu Leu Glu 
65 70 75 80 

Ala Ala Val Pro Asp Ala Thr He Gin Thr Val Asn Asp Gly Glu Ala 

85 90 95 

He Leu Val Thr Phe Asp Ser Gly He Leu Phe Ala Thr Asn Ser Ser 

100 105 110 

Thr Leu Ser Pro Asn Ser Arg Thr Ala Leu Thr Lys Phe Ala Ala Asn 

115 120 125 

Met Asn Lys Asn Pro Asp Thr Asp He Arg He Val Gly His Thr Asp 

130 135 140 

Asn Thr Gly Ser Asp Lys He Asn Asp Pro Leu Ser Glu Arg Arg Ala 
145 150 155 160 

Ala Ser Val Tyr Ser Phe Leu Asn Ser Gin Gly Val Ser Met Ser Arg 

165 170 175 

Met Ala Ala Glu Gly Arg Gly Ser His Glu Pro Val Ala Asp Asn Ser 

180 185 190 

Thr Val Ala Gly Arg Ser Ala Asn Arg Arg Val Glu Val Tyr He L u 

195 200 205 

Pro Asn Ala Lys M t He Glu Gin Ala Gin Gin Gly Thr Leu Lys 
210 215 220 



^ij^) INFORMATION FOR SEQ ID NO: 18 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 426 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 

(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION 1 . . .426 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18 

Met Glu Phe Lys lie Val Lys lie Asn Gin Glu Tyr Lys Asn Val Val 

15 10 15 

Val Ser His Lys Val Leu lie Glu Ala Glu Leu Glu Gin Gin Lys Lys 

20 25 30 

Glu lie lie Gly Lys Leu Glu Lys Gly Gin Val Leu Glu Gly lie Val 

35 40 45 

Lys Asn He Thr Ser Tyr Gly Val Phe He Asp Leu Gly Gly Val Asp 

50 55 60 

Gly Leu He His He Thr Asp Leu Ser Trp Gly Arg Val Ala His Pro 
65 70 75 80 

Glu Glu He Val Gin Leu Asp Gin Lys He Asn Val Val He Leu Asp 

85 90 95 

Phe Asp Glu Asp Arg Lys Arg He Ala Leu Gly Leu Lys Gin Leu Met 

100 105 110 

Pro His Pro Trp Asp Ala Leu Asp Ser Glu Leu Lys Val Gly Asp Lys 

115 120 125 

Val Lys Gly Lys Val Val Val Met Ala Asp Tyr Gly Ala Phe Val Glu 

130 135 140 

He Ala Gin Gly Val Glu Gly Leu He His Val Ser Glu Met Ser Trp 
145 150 155 160 

Thr Gin His Leu Arg Ser Ala Gin Asp Phe Leu His Val Gly Asp Glu 

165 170 175 

Val Glu Ala Val He Leu Thr Leu Asp Arg Glu Glu Arg Lys Met Ser 

180 185 190 

Leu Gly Leu Lys Gin Leu Lys Pro Asp Pro Trp Ala Asp He Glu Thr 

195 200 205 

Arg Phe Pro Val Gly Ser Arg His His Ala Arg Val Arg Asn Phe Thr 

210 215 220 

Asn Phe Gly Val Phe Val Glu He Glu Glu Gly Val Asp Gly Leu He 
225 230 235 240 

His He Ser Asp Leu Ser Trp Thr Lys Lys He Lys His Pro Ser Glu 

245 250 255 

Phe Thr Glu Val Gly Ala Asp He Glu Val Gin Val He Glu He Asp 

260 265 270 

Lys Glu Asn Arg Arg Leu S r Leu Gly His Lys Gin Leu Glu Glu Asn 

275 280 285 

Pro Trp Asp Val Phe Glu Thr Val Ph Thr Val Gly Ser He His Glu 

290 295 300 

Gly Thr Val H Glu Val M t Asp Lys Gly Ala Val Val Ser L u Pro 
305 310 315 320 

Tyr Gly Val Glu Gly Ph Ala Thr Pro Lys His Met Val Lys Glu Asp 
325 330 335 
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ay Ser Gin Ala Val Leu Glu Glu Lys Leu Pro Phe Lys Val lie Glu 
340 345 350 

Phe Asn Lys Asp Ala Lys Arg lie lie Val Ser His Ser Arg Val Phe 

355 360 365 

Glu Asp Glu Gin Lys Met Ala Gin Arg Glu Ala Asn Ala Glu Arg Lys 

370 375 380 

Ala Glu Ala Lys Ala Ala Gin Lys Glu Ala Ala Ala Glu Ala Ala Asn 
385 390 395 400 

Pro Ala Gin Ala Val Glu Lys Ala Thr Leu Gly Asp Leu Gly Glu Leu 

405 410 415 

Ala Ala Leu Lys Glu Lys Leu Ser Glu Asn 
420 425 
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